AITopics | theoretical contribution

Collaborating Authors

theoretical contribution

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Higher-Order Certification For Randomized Smoothing

Neural Information Processing SystemsDec-23-2025, 22:11:22 GMT

Randomized smoothing is a recently proposed defense against adversarial attacks that has achieved state-of-the-art provable robustness against $\ell_2$ perturbations. A number of works have extended the guarantees to other metrics, such as $\ell_1$ or $\ell_\infty$, by using different smoothing measures. Although the current framework has been shown to yield near-optimal $\ell_p$ radii, the total safety region certified by the current framework can be arbitrarily small compared to the optimal. In this work, we propose a framework to improve the certified safety region for these smoothed classifiers without changing the underlying smoothing scheme. The theoretical contributions are as follows: 1) We generalize the certification for randomized smoothing by reformulating certified radius calculation as a nested optimization problem over a class of functions.

artificial intelligence, proceedings, safety region, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

positive scores (7 7 7 6) and that all the reviewers appreciated the paper for the following: (i) theoretical contributions

Neural Information Processing SystemsNov-20-2025, 08:29:42 GMT

We thank all the reviewers for their time and effort in providing feedback. For clarity, we would like to reiterate the goal and motivation of the paper. We address the individual concerns below. We thank R3 for pointing out the typo. Thus, the approximated network achieved 97.17% test set accuracy with On the other hand, one of our networks resulting from edge-popup achieved a 97.53% test set accuracy by retaining We would again like to thank the reviewers for the positive reviews.

artificial intelligence, machine learning, theoretical contribution, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)

Add feedback

on both our theoretical contributions showing an equivalence between a notion of training speed and the Bayesian

Neural Information Processing SystemsNov-14-2025, 06:48:41 GMT

We thank the reviewers for their helpful feedback. We now address some concerns. We have replicated the DNN experiments (S4.2) We can derive this result using Jensen's'I was not able to ascertain how the result of Theorem 2 is used in the text, I'd be happy if the authors could clarify.' 'I found that the transition to the neural networks remains a bit confusing.' 'how much the results support the marginal likelihood-based model selection hypothesis, or whether they should more

artificial intelligence, machine learning, training speed, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.39)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 19:52:35 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper derives policy gradient algorithms for risk-sensitive MDPs for the particular criterion CVaR - a recent and popular criterion. First, the author derive gradients for the objective based on a Lagrangian relaxation of the constrained optimization. This naturally turns into a policy gradient algorithm where the expected return that appears in the gradient is estimated from full trajectories (reinforce-like). They then propose a scheme to obtain incremental actor-critic versions, where the critic computes the value (and other quantities) of an augmented MDP convenient for gradient estimation.

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.35)

Add feedback

Review for NeurIPS paper: Untangling tradeoffs between recurrence and self-attention in artificial neural networks

Neural Information Processing SystemsFeb-7-2025, 08:20:08 GMT

Additional Feedback: - Line 145, how can Theorem 1 be related to the early attention mechanism [1]? As the attention weights are computed adaptively, it is unlikely that they are uniform. MANNs learn to store relevant hidden states to a fixed-size memory, which seems to have the same purpose as relevancy screening mechanism. What is the advantage of the proposed method over MANNs? How are MANNs related to the Theorem 2? - The paper neglects prior works that also aim to quantify gradient propagation in RNNs and attentive models [4,5].

artificial intelligence, machine learning, neural network, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Reviews: Integrating Bayesian and Discriminative Sparse Kernel Machines for Multi-class Active Learning

Neural Information Processing SystemsJan-26-2025, 18:43:14 GMT

Originality: The combination of sampling in areas of'greater interest' while adjusting to the underlying distribution appears in many active learning works, but the objective in (1) is novel and approaching both in a unified framework is challenging. The lower bounding of the optimization problem is also new Quality: The experimental results are very thorough and show the improvement of the proposed method over random sampling as well as several other baselines. And the exploration of effect of tuning parameters and initial sample size is excellent. However the theoretical contributions appear incomplete. The significant theoretical contribution is the (mislabelled) Theorem 2, and both the statement and proof of this is extremely informal.

artificial intelligence, discriminative sparse kernel machine, machine learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.40)

Add feedback

Review for NeurIPS paper: Joint Contrastive Learning with Infinite Possibilities

Neural Information Processing SystemsJan-26-2025, 16:32:40 GMT

Additional Feedback: I think it is too strong to claim that "we also theoretically unveil the certain important mechanisms that govern the behavior of JCL." The main theoretical tool in the proposed method is an application of Jensen's inequality. There is also a section (3.3) that discusses some very basic properties of the the objective. To claim any of this as a significant "theoretical contribution" is too strong in my view. To me, the most interesting aspect of Fig2 is part (b).

artificial intelligence, certain important mechanism, machine learning, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.67)

Add feedback

Reviews: Implicitly learning to reason in first-order logic

Neural Information Processing SystemsJan-21-2025, 10:53:05 GMT

This paper is generally well written and clear, albeit targeting readers with formal backgrounds. The quality of the paper seems high in terms of its formal claims. The proposed mechanism is remarkable simple, making this an attractive approach. I really like the idea behind not making learning explicit (as opposed to rule induction for example). I have three main concerns about this paper: - In general it is very close to Juba's 2012 work [1].

first-order logic, logic & formal reasoning, machine learning, (6 more...)

Neural Information Processing Systems

Country: Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.29)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.99)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.52)

Add feedback

Higher-Order Certification For Randomized Smoothing

Neural Information Processing SystemsOct-9-2024, 21:26:21 GMT

Randomized smoothing is a recently proposed defense against adversarial attacks that has achieved state-of-the-art provable robustness against \ell_2 perturbations. A number of works have extended the guarantees to other metrics, such as \ell_1 or \ell_\infty, by using different smoothing measures. Although the current framework has been shown to yield near-optimal \ell_p radii, the total safety region certified by the current framework can be arbitrarily small compared to the optimal. In this work, we propose a framework to improve the certified safety region for these smoothed classifiers without changing the underlying smoothing scheme. The theoretical contributions are as follows: 1) We generalize the certification for randomized smoothing by reformulating certified radius calculation as a nested optimization problem over a class of functions.

artificial intelligence, higher-order certification, safety region, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Reviews: Generalization Properties of Learning with Random Features

Neural Information Processing SystemsOct-7-2024, 22:41:30 GMT

This is in my opinion an excellent paper, a significant theoretical contribution to understanding the role of the well established random feature trick in kernel methods. The authors prove that for a wide range of optimization tasks in machine learning random feature based methods provide algorithms giving results competitive (in terms of accuracy) to standard kernel methods with only \sqrt{n} random features (instead of linear number; this provides scalability). This is according to my knowledge, one of the first result where it is rigorously proven that for downstream applications (such as kernel ridge regression) one can use random feature based kernel methods with relatively small number of random features (the whole point of using the random feature approach is to use significantly fewer random features than the dimensionality of a data). So far most guarantees were of point-wise flavor (there are several papers giving upper bounds on the number of random features needed to approximate the value of the kernel accurately for a given pair of feature vectors x and y but it is not clear at all how these guarantees translate for instance to risk guarantees for downstream applications). The authors however miss one paper with very relevant results that it would be worth to compare with theirs.

artificial intelligence, machine learning, random feature, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback